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1 Introduction 



According to the present knowledge of physics, the ultimate constituents of 
matter are quarks and leptons. Both of them are particles of spin 1/2 that 
interact by interchanging spin 1 particles, namely photons, gluons, , W~ 
and The existence of the Higgs spin particle is presently under exper- 
imental investigation. 

The issues of relativity and quantum mechanics, that are strictly necessary 
to understand atomic and subatomic world, have favored the development of 
local field theories in which, as we said, the interactions are mediated by the 
interchange of the (virtual) integer spin particles mentioned above. A general 
feature of these theories is that, in the field Lagrangian or Hamiltonian, the 
interaction term is simply added to the term that represents the free motion 
of the particles. 

As for the free term of the matter, spin 1/2, particles, it gives rise to the 
Dirac equation, that represents the relativistic, quantum mechanical wave 

equation for these particles. 

These arguments explain the great importance of Dirac equation for the 
study of particle physics at fundamental level. However, it is also strictly 
necessary to understand many important aspects of atomic physics, nuclear 
physics and of the phenomenological models for hadronic particles. 

An introduction to this equation represents the objective of the present work 
that is mainly directed to students with good foundations in nonrelativistic 
quantum mechanics and some knowledge of special relativity and classical 
electrodynamics. 

We shall not follow the historical development introduced by Dirac and 
adopted by many textbooks. In that case, the Lorentz transformation (boost) 
of the Dirac spinors is performed only in a second time, without clarifying 
sufficiently the connection between the mathematics and the physical mean- 
ing of that transformation. 

In this paper the Dirac equation will be derived starting from the basic 
principles of special relativity and quantum mechanics, analyzing the trans- 
formation properties of the relativistic spinors. 

This development will be carried out without entering into the mathematical 
details of the Lorentz group theory, but keeping the discussion at a more 
physical level only using the mathematical tools of linear vector algebra, as 
row by column matrix product and vector handling. 
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In our opinion this introductory approach is highly recommendable in order 
to stimulate the students to make independent investigations by using the 
powerful concept of relativistic covariance. 

In a subsequent work we shall analyze in more detail the properties of Dirac 
equation and derive some relevant observable effects. To that work we shall 
also defer an introduction to the field theory formalism that is needed to give 
a complete physical description of subatomic world. 

The subjects of the present work are examined in the following order. 

In Subsection 1.1 we give some tedious but necessary explanations about the 
adopted notation. 

In Section 2 we study some relevant aspects special relativity, focusing our 
attention on the properties of the Lorentz transformations. 
Their fundamental properties are recalled in Subsection 2.1. 
We briefiy analyze, in Subsection 2.2, classical electrodynamics as a relativis- 
tic fields theory. 

In Subsection 2.3 we examine the hyperbolic parametrization of the Lorentz 
transformations, introducing concepts and techniques that are widely applied 
in relativistic quantum mechanics for the construction of the boost operators. 

Lorentz transformations in an arbitrary direction are given in subsection 2.4. 
A very important point of this work is studied in Subsection 2.5, where the 
commutation rules of the Lorentz boost generators, rotation generators and 
parity transformation are derived. 

In Section 3 we make use of the concepts of relativity to lay the foundations 
of relativistic quantum mechanics. 

In Subsection 3.1 we discuss, as an example, the relativistic wave equation 
for a spin particle. 

In Subsection 3.2 we introduce the (quantum-mechanical) Dirac equation for 
spin 1/2 particles, starting from the commutation rules of the boost genera- 
tors, rotation generators and parity transformation. 

The properties of the Dirac Gamma matrices and their different representa- 
tions are examined in Subsection 3.3. 

Some relevant matrix elements of Dirac operators, as 7^, are studied in Sub- 
section 3.4. 
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Finally, plane wave solutions and the corresponding conserved current are 
found and discussed in Subsection 3.5. 

The Appendix is devoted to study some useful properties of the Pauli matri- 
ces. 

1.1 Notations and Conventions 

We suggest the reader to read cursorily this Subsection and to go back to it 
when he finds some difficulty in understanding the other parts of the paper. 
First of all, the space time position of a particle is denoted as = {x'^, r) 
with x^ = ct and r ^). To avoid confusion, we use this last 

notation instead of the standard one, that is {x, y, z). 
Greek letters of the "middle" part of the alphabet, as /x, z/, p, cr, ... running 
from to 3, are used to denote four- vector components. On the other hand 
the letters of the beginning of the Greek alphabet, as a, (3, 5, ... running 
from 1 to 3, denote three-vector components. This last notation with upper 
indices will be used extensively even though the corresponding quantity does 
not make part of a four- vector. 

Repeated indices are always summed, unless otherwise explicitly stated. 
For two three- vectors, say a and b, the scalar product is denoted as 

ab = a'^h'^ 

If one of the two vectors is a set of the three Pauh [a^) or Dirac {a^) , (7'^) 
matrices, we use the notation 

(era) = a^a^, (aa) = a^a\ (7a) = -f^a^ 

Furthermore, the notation V collectively indicates the derivatives with re- 
spect to the three components of the position vector r. 

Lower indices are only used for four-vectors and denote their covariant com- 
ponents as explained just after eq.(2.2). Invariant product of two four- vectors 
is introduced in eq.(2.3). For the unit vectors we use the standard notation 

a 

a = -j— j- 
|a| 

When a four- vector is used as an argument of a field or wave function, the 
Lorentz index /i, u, p, a, ... is dropped and, more simply , we write 

A^{x), ^l;{x) 
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where x represents collectively all the components of the four- vector x^. 
In order to denote products of matrices and four-vectors, we arrange the 
components of a four- vector, say in a column vector, denoted as [x\. The 
corresponding transposed vector [xY is a row vector. Standard Latin letters, 
without indices, are used to denote matrices. See, for example, eq.(2.6). We 
use this notation also for the set of the four Dirac matrices at the end of 
Subsection 3.2. 

Four components Dirac spinors, introduced in Subsection 3.2, are handled 
according to the same rules of vector algebra. They are denoted by a Latin 
letter without parentheses. 

We recall that the hermitic conjugate of the Dirac spinor is a row spinor 
defined as: 

u' = u 

For the commutator of two matrices (or operators), say Q, R, we use the 
notation 

[Q, R]^QR- RQ 
For the anticommutator we use curly brackets 

{Q, R} = QR + RQ 

2 Relativity 

The principle of relativity, that was found by Galilei and Newton, states that 
it is possible to study physical phenomena from different inertial reference 
frames (RF) by means of the same physical laws. The hypothesis of an 
absolute reference frame is not allowed in physics. 

Obviously, one has to transform the result of a measurement performed in 
a reference frame to another reference frame, primarily the measurements of 
time and space. 

Requiring the speed of light c to be independent of the speed of the reference 
frame, as shown by th Michelson-Morley experiment, one obtains the Lorentz 
transformations that represent the formal foundation of Einstein's special 
relativity. The reader can find in ref.[l] a simple and satisfactory development 
of this point. 
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2.1 Fundamental Aspects of Lorentz Transformations 



Considering a RF S' moving at velocity v along the x^-axis with respect to 
S, one has the standard Lorentz transformations 



X 



.10 



X 



/I 



V 

= 7(x° x^) 

c 

7(-^x° + x^) 



X 



12 



X 



/3 



X 



(2.1a) 



where x^ — ct, {x^, x'^, x^) — r and 7 = [1 — [v/cY 



-1/2 



A thorough study of the subject of this Subsection, that consists in general- 
izing the previous equations, can be found in ref.[2]. In the present paper we 
highlight some specific aspects that are relevant for a quantum-mechanical 
description of elementary particles. 



The Lorentz transformations of cq.(2.1a) can be syntetically written as 



X- 



L>',.{v)x^ 



(2.16) 



where the indices /x, v take the values 0, 1, 2, 3 and x^ is denoted as con- 
travariant four-vector. 

By introducing the Minkowsky metric tensor 



1 






00 
10 
0-1 
0-1 



(2.2) 



one can construct covariant four-vectors x^ = g^yX" and invariant quantities 
as products of covariant and contravariant four-vectors. For example, given 
two contravariant four-vectors, say = {s^.s) and = (Z^,!), one can 
construct their covariant counterparts = (s°,— s), = (Z°, — 1) and the 
quantity 

(2.3) 
(2.4) 



07O 



si 



that is invariant under Lorentz transformation: 
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In particular, the Lorentz transformation of eq.(2.1a) is obtained [1,2] by 
requiring the invariancc of the propagation of a spherical light wave, that is 
the invar iance of x^x^ = 0. 
The invariance equation (2.4) requires 

g,,L^ML'M-9^a (2.5) 

In many cases it is very useful to work with standard linear algebra notation. 
Furthermore, at pedagogical level, this technique is very useful to introduce 
standard handling of Dirac spinors. 

Identifying a four- vector with the column vector [x\ , the invariant product 
of eq.(2.4) is written as 

s>'g,J^ - [sYg[l] (2.6) 

where the upper symbol T denotes the operation of transposition. By means 
of this notation, eq.(2.5) reads 

L{v)gL{v) = g (2.7) 

where we have used the important property, directly obtained from eq.(2.1a), 
that L^{v) = L{v). Also, a covariant four-vector is [x^ — g[x]. Its 
transformation is 

[<] = gL{v)[x\ = gL{v)gg[x\ = gL{v)g[x,] (2.8) 
Let us now multiply eq.(2.7) by g from the right, obtaining 

L{v)gL{v)g = 1 (2.9) 

In consequence 

gL{v)g^L-\v) (2.10) 

it means that the covariant four-vectors, look at eq.(2.8)!, transform with the 
inverse Lorentz transformations. By means of direct calculation or by using 
the principle of relativity one finds that 

L-^{v) = L{-v) (2.11) 



We recall some relevant physical quantities that are represented by (i.e. trans- 
form as) a four-vector. As previously discussed, we have the four-position 
(in time and space) of a particle denoted by x^. 
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We now define the four- vector that represents the energy and momentum of 
a particle. 

Previously, we introduce the (invariant) rest mass of the particle. In the 
present work this quantity will be simply denoted as the mass m. We shall 
never make use of the so-called relativistic mass. 
We also define the differential of the proper (invariant) time as 



dr ^ -[dxadx^Y/^ = 
c 



{dtf - -Advf 



1/2 



= dt 



21 1/2 



dt 
7 



(2.12) 



where the velocity 



dx 



represents the standard physical velocity of the particle measured by an ob- 
server in a given reference frame. Furthermore, the factor 7 is a function of 
that velocity, of the form: 



7 = 



V 2- 
l-(-) 

c 



-1/2 



The energy-momentum four- vector is obtained differentiating the four-position 
with respect to the proper time and multiplying the result by the mass m. 
One has 

E dx^ 
— (— , p) — m—— — (mc'j, mv7) (2-13) 
c dr 

In previous equation, E represents the energy of the particle and p its three- 
momentum. More explicitly, the energy is 



E 



For small values of the velocity |v| << c one recovers the nonrelativistic 
limit, that is 



E ~ mc^ -mv^ ... (2.14a) 

p~mv-F... (2.146) 

Note that the energy and momentum of a particle belong to the four-vector 
of eq.(2.13). In consequence, energy and momentum conservation can be 
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written in a manifestly covariant form. For example, in a collision process in 
which one has a transition from an initial state (I) with Nj particles, to a final 
state (F) with Np particles, the total energy and momentum conservation is 
written by means of the following four- vector equality 



that holds in any reference frame. A complete discussion of the physical con- 
sequences of that equation and related matter is given in rcf. [3]. Only recall 
that, at variance with nonrclativistic mechanics, mass is not conserved. In 
general, mass-energy transformations are represented by processes of creation 
and destruction of particles. As a special case, a scattering reaction is defined 
elastic, if all the particles of the final state remain the same (obviously, with 
the same mass) as those of the initial state. 

Four-momentum conservation of eq.(2.15) is a very simple example. In gen- 
eral, a physical law written in a manifestly covariant form automatically 
fulfills the principle of relativity introduced at the beginning of this section. 
A physical law is written in a manifestly covariant form when it is written as 
an equality between two relativistic tensors of the same rank: two Lorentz 
invariants (scalars), two four- vectors, etc.. 

Going back to eq.(2.13) one can construct the following invariant 



The second equality is obtained in the easiest way by calculating the invariant 
in the rest frame of the particle, where = (mc, 0). 

From the previous equation one can construct the Hamiltonian of a particle, 
that is the energy written as function of the momentum 



(2.15) 



i=l i=l 




(2.16) 




(2.17) 



that in the nonrelativistic limit reduces to 



E ~ mc^ + ^ + ... 



2m 
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Note that in eq.(2.17) we have taken only the positive value of the square 
rooth. This choice is perfectly legitimate in a classical context, where the 
energy changes its value in a continous way. On the other hand negative 
energy solutions cannot be discarded when considering quantum-mechanical 
equations. 

From eqs.(2.13) and (2.17), the velocity of a particle is 



In the second relation, the equality is satified by massless particles. The 
constraint on velocity has a more general validity, as we shall see when re- 
vising electromagnetism: everything that carries information cannot have a 
velocity greater than the speed of light c. 

For the study of both classical and quantum-mechanical (field) theories it is 
very important to determine the transformation properties of the derivative 
operator 



The reader is suggested to derive them by using directly the chain rule. We 
propose here a simpler proof. Let us consider the invariant x^x^, = (a;°)^ — 
and apply to it the derivative operator. One has 



That is, the derivative with respect to the contravariant components gives, 
and transforms as , a covariant four- vector {2x^ in the previous equation). 
Conversely, the derivative with respect to the covariant components trans- 
forms as a contravariant four-vector: 




v| < c 




d 



x^x" = 2Xf, = 2(x°, -r) 



(2.18a) 



^x^' 




(2.186) 



For this reason the following notation is introduced 



d 



(2.19a) 



dx^^ 
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and 

d 

Straightforwardly one verifies tliat 



(2.1%) 



' ' (2.20) 



is an invariant operator. 

2.2 Electromagnetism and Relativity 

The elements that have been developed in the preceding Subsection will help 
us to understand the relativistic properties of classical electromagnetism. 
In summary, electromagnetism is a local theory in which the interaction 
between charged particles is carried by the electromagnetic field, at light 
speed c. A complete analysis of this theory can be found, for example, in 
refs.[2,4]. 

With respect to interaction propagation, the reader should realize that New- 
ton's theory of gravitational interaction is not compatible with special rela- 
tivity. In fact the gravitational potential energy 

_ Gmim2 

9 

r 

depends instantaneously on the distance r between the two bodies. If one 
body, say the ^1, changes its position or state, the potential energy, and 
in consequence, the force felt by the body ^2 changes at the same instant, 
implying a transmission of the interaction at infinite velocity. 
Note that, on the other hand, the expression of Coulomb potential energy, 
that is formally analogous to the Newton's gravitational one, holds exactly 
exclusively in the static case. According to classical electromagnetism, if the 
interacting particles are in motion, it represents only approximatively their 
interaction. This approximation is considered good if their relative velocity 
is 

|v| << c 

The fundamental quantity of electromagnetism is the vector potential field 
Ai"^ (A°,A). 



12 



A field is, by definition, a function of tlie time-space position x'^. As done in 
most tcxbooks, in tlie following we shall drop the index v of the argument, 
simply writing A^^ = A^{x). 

Synthetically, we recall that the Maxwell equations have the form 

d^&'A'' = —f (2.21) 

c 

with the Lorentz invariant Gauge condition 

d^A*" = (2.22) 

where we have introduced the current density 

f = (cp(x), j(x)) (2.23) 

Applying the derivative operator to eq.(2.21) and using eq.(2.22), one 
finds the current conservation equation, that is 

d^^f = ^P(^) + = (2.24) 

All the equations written above are manifestly covariant and the Lorentz 
transformations can be easily performed. If a solution of eqs.(2.21) and (2.22) 
is found in a reference frame S, it is not necessary to solve the equations in 
the reference frame S' , but simply one can transform the electromagnetic 
field: 

A'^ix') = L^^{v)A''{x{x')) (2.25) 

In more detail, one has 

(i) to tranform the field mixing its components by means of L^^(w), that 
is the first factor of the previous equation, but also 

(ii) to express the argument x of the frame <S as a function of x' measured in 
S', that is, recalling eq.(2.11) 

x'' = L''p{-v)x'P 
We briefly deflne the last operation as argument re- expression. 

The reader should note that such double transformation occours in the same 
way when a rotation is performed. In this case the space components A 
are mixed by the rotation matrix (for this reason the electromagnetic field 
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is defined as a vector field) and the argument r must be expressed in terms 
of r' by means of the inverse rotation matrix. Under rotation, in the time 
component vl", one only has the argument re-expression of r. 

In principle it is possible to construct a scalar field theory (even though 
there is no evidence of such theories at macroscopic level). In this case the 
field is represented by a one-component function (j){x). Both the Lorentz 
transformation and the rotations only affect the argument x in the same way 
as before, but no mixing can occur for the single component function <p. One 
has only to perform the argument re-expression. 

We shall now explain with a physical relevant example the use of the trans- 
formation (2.25) for the electromagnetic field. 

Let us consider a charged particle moving with velocity u along the x^-axis. 
What is the field produced by this particle ? 

We introduce a reference frame S in which the particle is at rest, while the 
observer in S' sees the particle moving with velocity u along x^. The velocity 
of S' with respect io S is v = —u. The field in S is purely electrostatic, that 
is 

^° = ^°(ct,r) = ^ (2.26a) 
|r| 

A = A(ci,r) = (2.266) 

where q represents the charge of the particle. We find A'^^ by means of 
eq.(2.25). First, one has 

A"" = (2.27a) 

A" = -7>1° (2.276) 
c 

^/a = ^ Q ^2.27c) 

with a = 2, 3 and 7 = [1 - {u/cf]-^/^. 

Now we express |r| in terms of {ct',r'), that is we perform the argument 
re- expression. 

By means of eqs.(2.1) and (2.11) one has 

= -f{-ut' + x'^) 
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so that 

|r| = [^\-ut' + + {xy + {x^Y'^ (2.28) 

By means of the previous equation the final expression for the field of 
eqs.(2.27a,b) is 

A\ct\ r') = q^[^\-ut' + x'^f + {x"-f + (x''^)^]-^^ (2.29a) 

A^\ct\v')^-A"'{ct\v') (2.2%) 
c 

This example has been chosen to explain the procedure for transforming a 
field function. 

The field of eqs.(2.29a,b) can be directly derived in the frame S' by solving 
the Maxwell equations (2. 21), (2. 22) as done in refs.[2,4]. The technique of the 
Lienard Wiechert potentials can be used. But, as the reader should check, 
much more mathematical efforts are required. 

2.3 The Hyperbolic Parametrization of the Lorentz 
Transformations 

Going back to the Lorentz transformations of eq.(2.1) we note that the coef- 
ficients of the of the transformation matrix L{y) are: 

V 

7 , - -7 

c 

Wc square both terms (the minus sign disappears in the second one) and 
take the difference, obtaining 




RecaUing that the hyperbolic functions satisfy the relation 

ch^uj — sh^uj = 1 
one can choose the following parametrization 

V 

J = ch LU , -■J = sh LU (2.30a) 
c 
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with 

-^thuj (2.306) 
c 

that connects the hyperbolic parameter uj with the standard velocity v. We 
now show the reason why the parametrization L{ijj) is very useful for the 
following developments. 

Let us consider two subsequent Lorentz transformations along the 
with hyperbolic parameters vj and ^. The total transformation is given by 
the following product of Lorentz transformations, that, by using the vector 
algebra notation, is written in the form 

[x'] = L{i^)L{i)[x] (2.31) 

The reader can calculate explicitly L{rj)L{^) by means of standard rules for 
row by column matrix product, then recalling 

sh{r] + ^) — sh rj ch ^ + sh ^ ch rj 

ch{r] + ^) = ch T] ch ^ + sh ^ sh T] 

one finds 

L{v)m=Liv + (2.32) 

Note that the previous result strictly depends on the chosen hyperbolic 
parametrization. Due to the relativistic nonlinear composition of veloci- 
ties, considering two subsequent Lorentz transformations, with v/c = th rj 
and w/c — th ^, one has, in contrast to eq.(2.32), 

L{v)L{w) L{v + w) (2.33) 

On the other hand, the composition of two Lorentz transformations with 
hyperbolic parametrization, as given in eq.(2.32), has the same form as the 
composition of two rotations around the same axis. 
Eq.(2.32) is the clue for the following development. 

By means of hyperbolic parametrization, we can now turn to express a finite 
Lorentz transformation in terms of the corresponding infinitesimal transfor- 
mation. 

Let us consider the case of small velocity, that is v/c << 1 or equivalently, 
for the hyperbolic parameter, a; ~ (see eq.(2.30b)). 
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In particular, at first order in a; or in v/c, one has 

V 

c/ia;~l, sh (jj 0:^ uj c:^ - 

c 

and, in consequence 

L{uj) ~ 1 + uK^ (2.34) 

where 1 and respectively represent the identity matrix and the generator 
of the Lorentz transformation matrix along the axis. This second term is 
usually called boost generator. 

Explictly, the matrix is easily obtained considering eqs.(2.1), (2.31a) and 
the above Taylor expansions of the hyperbolic functions. It has the form 



0-1 

-10 







'a' 




(2.35) 



The second expression in the previous equation is given for pedagogical rea- 
sons, that is to familiarize the reader with block matrices. 
In fact, the 4x4 matrix is written as a block matrix, in which each block 
is represented by a 2 x 2 matrix. In particular the upper left block is the 
Pauli matrix cr^, in the other "0" blocks the four entries of each block are all 
vanishing. The properties of the Pauli matrices are studied in the Appendix. 
For their definition see eq.(A.l). 
The reader should note the following two points: 

(i) there is no direct connection between of the previous equations and 
the quantum mechanical spin operator, 

(ii) as for the row by column product of a block matrix, the same rules of 
standard matrices must be used. 

In order to reconstuct the finite boost L(a;), (a; finite), we apply times, 

with A^ — > oo, the infinitesimal transformation of eq.(2.34). 

The linear boost composition law of eq.(2.32) allows to derive the following 

equation 



L(uj) = lim 1 + —K^ = expicuK^ 



l + {chu} -I) 



1 




+ sh ujK^ 



(2.36a) 
(2.366) 



17 



The second equality of eq.(2.36a)is obtained by comparing the series expan- 
sion in powers of u of the exponential, with ( 1 + ^K'^j for — > oo . 
Eq. (2.36b) is derived working on that series expansion. One has the following 
rules for the powers of 

{K^f = 1 , {K^f' = 

that, for example, can be derived from the corresponding properties of cr^ by 
means of eq.(A.4). 

The coefficients that multiply [K^Y'^ and [K'^Y'^^^ can be summed up, giv- 
ing ch uj — 1 and sh uj, respectively. One can straightforwardly check that 
eq. (2.36b) is equal to eq.(2.1) with the hyperbolic parametrization of eq. (2.30a) 
We have developed in some detail this example as a guide to construct the 
finite boost transformations for Dirac spinors in eqs.(3.17a,b). 

We remind the reader that all the relevant properties of the boost transfor- 
mation are contained in the infinitesimal form given in eq.(2.34) with the 
matrix boost generator of eq.(2.35). The finite expression of the boost is 
obtained by means of a standard mathematical procedure that does not add 
new physical information. 

2.4 Lorentz Transformations in an Arbitrary Direction 

In the previous developments we have considered Lorentz transformations 
along the x^-axis. The transformations along the x'^- and x^-axis are directly 
obtained interchanging the spatial variables. In this way (as done in ref.[4]) 
one obtains a sufficiently general treatment of relativistic problems. For 
completeness and to help the reader with the analysis of some textbooks (as 
for example ref.[2]) and research articles, we now study Lorentz transforma- 
tions with an arbitrary boost velocity v direction. The comprehension of the 
other Sections of this work docs not depend on this point. In consequence, 
the reader (if not interested) can go directly to eq.(2.41). 
For definiteness we consider the time-space four- vector x^ — (x°, r), but the 
results hold for any four- vector. 

The transformation equation (2.1) can be generalized in the following way 

TV 

x' = 7(a;° - — ) (2.37a) 



1 U 




(^l)2n+l _ ^1 
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r'v = 7(--^° + rv) (2.376) 

v\ = (2.37c) 

where the unit vector v has been introduced so that v = vv with ?; > and 
the notation denotes the spatial components of r perpendicular to v. 
Eq. (2.37a) directly represents the Lorentz transformation of the time com- 
ponent of a four- vector for an arbitrary direction of the boost velocity. 
Some handling is necessary for the spatial components of the four-vector. 
Starting from eqs.(2.37b,c) we now develop the transformation for r. One 
can parametrize this transformation according to the following hypothesis 

r' ^r + f{v)\{rY)Y + g{v)-x\ (2.38) 

Note that it correctly reduces to the identity when v = and automatically 
gives eq. (2.37c) for the perpendicular components of the four- vector. 
Multiplying the previous equation by v and comparing with eq(2.37b) one 
finds 



g{v) = -7 (2.39a) 

where the last expression of eq. (2.39b) is obtained by using the standard 
definition of the factor 7. 

Analogously to eq. (2.36a), the Lorentz transformation for the four- vector 
given by eqs. (2.37a) and (2.38) can be written in exponential form, as 

L(u;v) = exp(a;vK) (2.40) 

with the same connection between uj and v as in eq.(2.30b). The matrices of 
the boost generator K = {K^, K'^, K^) are defined as 



" 0- 


1 





" 




' 


-1 








































-1 











0. 











- 1 


0" 




' 





-1 














































0. 




. -1 






(2.41) 
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Note that had been aheady derived in eq.(2.35). Furthermore, and 
can be directly obtained performing the Lorentz transformation analogously 
to eq.(2.1) but along the x^- and a;^-axis and repeating the procedure that 
leads to eq.(2.35). 



2.5 The Commutation Rules of the Boost Generators 

The most important property of the boost generators or, more precisely, of 
the matrices K given in eq.(2.41), is represented by their commutation rules. 
Let us consider an illustrative example. For generality, we shall denote the 
Lorentz transformation as boost, using the symbol B. 

In a first step, we perform a boost along x'^ with a small velocity. At first 
order in the hyperbolic parameter a;^ one has 

B^ ~ l+co^K^ 

Analogously, in a second step, we make a boost along x^, with hyperbolic 
parameter uj'^, that is 

B^ ~ 

The total boost, up to order uj^uj'^, is 

^12 ^ ^1^2 _ ^ ^ ^1^1 ^ ^2^2 ^ ^ V^ii^2 ^2.42a) 

Note the important property that the product of two boosts is a Lorentz 

boost because it satisfies eq.(2.7), as it can be directly verified. 

We now repeat the previous procedure inverting the order of the two boosts, 

obtaining 

What is the difference between the two procedures ? Subtraction of eqs.(2.42a,b) 
gives 

B^^-B^^c^uW[K\K^] (2.43) 

where the standard notation for the commutator of the matrices and K'^ 
has been introduced. Explicit calculation gives 




10 
0-100 




(2.44) 
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At this point two (connected) questions are in order. What is the meaning 

of the noncommutativity of the boost generators? Which physical quantity 

is represented by the commutator of the last equation? 

To answer these questions it is necessary to recall some properties of the 

rotations. 

They are initially defined in the three dimensional space. Let us rotate the 
vector r counterclockwise, around the axis, of the angle d^. For a small 
angle, at first order in 6*^, one obtains the rotated vector 

r' ~ r + 6l^k X r (2.45a) 

where k represents the unit vector of the x"^ axis. One can put r and r' in the 
three component column vectors [r] and [r'] so that the previous equation 
can be written with the vector algebra notation as 

[r'] ~ (l + e^s^) [r] (2.456) 

where (see the next equation) represents the three-dimensional generator 
matrix of the rotations around the axis x^. The same procedure can be 
repeated for the rotations around the axes x^ and x^. The generator matrices 
are 

-1 0" 

1 


(2.46) 

As it is well known the (previous) rotation generator matrices do not com- 
mute: 

[s",/] = e"''V (2.47) 

where we have introduced the Levi-Civita antisymmetric tensor e"^^ . 
As for the noncommutativity^ this situation is partially similar to the case 
of the boost generators shown in eq.(2.43) but, for the rotations, eq.(2.47) 
shows that, given two generator matrices, their commutator is proportional 
to the third matrix, while we have not yet identified the physical meaning of 
the matrix in the r.h.s. of eq.(2.44). 

Pay attention ! In quantum mechanics, from eqs.(2.46), (2.47) we can intro- 
duce the spin 1 operators as 
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satisfying the standard angular momentum commutation rules. Our s"' do 
not directly represent the three spin operators. 

It is very important to note that the physical laws must be invariant under 
rotations. To make physics we assume that space is isotropic. As the hy- 
pothesis of an absolute reference frame must be refused, in the same way the 
idea of a preferencial direction in the space is not allowed by the conceptual 
foundations of physics. 

Obviously, rotational invariance must be compatible with relativity. This 
fact is immediately evident recalling that the rotations mix the spatial com- 
ponents of a vector without changing the scalar product of two three- vectors, 
say a and b: a'b' = ab. 

The time components of the corresponding four-vectors also remain unal- 
tered: a' ° = a° and 6' ° = 6°. Consider, as two relevant examples, the time 
and energy that represent the zero components of the position and momen- 
tum four-vectors, respectively. 

It means that rotations satisfy the invariance equation (2.3) and, in conse- 
quence, they are fully compatible with relativity. In terms of 4 x 4 matrices, 
eq. (2.45b) is generalized as 

[x'] ~ (l + e^S^) [x] (2.48) 
The 4x4 generator matrices are defined in terms of 3 x 3 s° as 





■ 


0" 


















. 





As it will be written in eq. (2.50b), the matrices S" obviously satisfy the same 
commutation rules of eq.(2.47). 

We can now verify that the r.h.s. of eq.(2.44) represents — S^. 
In general one has the following commutation rules 

For completeness, we also give 

S(^] = e"^''^'' (2.506) 



(2.50a) 
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and 

[5", K^] = e'^^^K^ (2.50c) 

where the last equation means that the boost generator K transforms as a 
vector under rotations. 

As for the derivation of the Dirac equation that wiU be performed in the next 
section, we anticipate here that a set of K"' and S"' matrices (different from 
eqs.(2.41) and (2. 46), (2. 49)) will be found, that satisfy the same commuta- 
tion rules of eqs.(2.50a-c). In mathematical terms, these new matrices are a 
different represenf.ation of the Lorcntz group, allowing to satisfy in this way 
the relativistic invariance of the theory. 

For the study of the Dirac equation, it is also necessary to introduce another 
invariance property related to a new, discrete, space-time transformation. It 
is the parity transformation, or spatial inversion, that changes the position 
three- vector r into — r, leaving the time component unaltered. This definition 
shows that spatial inversion does not change the invariant product of two 
four-vectors and, in consequence, is compatible with relativity. 
Parity is a discrete transformation that does not depend on any parameter. 
On the other hand, recall that rotations are continous tranformations, that 
continously depend on the rotation angle. Obviously, spatial inversion cannot 
be accomplished by means of rotations. 

Note that, under parity transformation, ordinary, or polar vectors, as for 
example the momentum p, do change sign in the same way as the position 
r, while the axial vectors, as for example the orbital angular momentum 
1 = r X p, do not change sign. On the other hand they transform in standard 
way under rotations. 

Using the definition given above, parity transformation on the space-time 
position, 

[x'] - n[x] 

is accomplished by means of the diagonal Minkowsky matrix. We can write 
that holds for the spatial inversion of all the four-vectors. 
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Prom the previous definition, one can easily verify the following anticommu- 
tation rule with the boost generators 



{U,K"} = 



(2.51a) 



or equivalently 



UK"U 



K' 



a 



(2.516) 



where we have used the standard property 
Furthermore 



= 1. 



[n, 5"] = 



(2.52a) 




(2.526) 



It shows that the rotation generators do not change sign under spatial inver- 
sion, that is they behave as an axial-vector. 

The determinant of Lorentz boost and rotations is equal to -|-1, while for 
spatial inversion it is —1. 

Note that eqs.(2.51a)-(2.52b) represent general properties of the parity trans- 
formation that do not depend on the tensor to which it is applied. They are 
derived, and hold, in the case of four- vectors, but they are also assumed to 
hold for the Dirac spinors. But, in this case, the following critical discussion 
is necessary. 

In fact, after these formal developments, we can ask: being parity compatible 
with relativity, are the physical laws of nature really invariant under spatial 
inversion? 

The situation is different with respect to rotations, that represent a necessary 
invariance for our understanding of nature. 

Initially, parity was considered an invariance of physics, but in the fifties 
the situation changed. In fact, some experiments on beta decay showed that 

weak interactions are not invariant under spatial inversion. On the other 
hand, gravitational, electromagnetic and strong ( or nuclear) interactions are 
parity invariant. 

When deriving the Dirac equation, we shall require the fulfillment of parity 
invariance, having in mind the study of electromagnetic and strong interac- 
tions. In a following work we shall discuss the weakly- interacting neutrino 
equations, that are not invariant under parity transformation. 
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We conclude this section mentioning another discrete transformation, called 
time reversal, that consists in changing the sign of time: t' = —t. Classical 
laws of physics are invariant with respect to this change of the sense of 
direction of time. The action of time reversal on the space- time four vector 
is represented by the matrix T — —11 = —g. 

At microscopic level, time reversal invariance is exact in strong and electro- 
magnetic processes, but not in weak interactions. However, this violation is 
of different kind with respect to that of parity transformation. 
We conclude pointing out that in the formalism of field theories the product of 
the three transformations : C {Charge Conjugation), P {Parity) and T {Time 
Reversal), is an exact invariance, as confirmed by the available experimental 
data. 



3 Relativist ic Quantum Wave Equations 



In this Section we shall study the procedure to implement the principles of 
special relativity in the formalism of quantum mechanics in order to introduce 

the fundamental Dirac equation. 

Previously, in Subsection 3.1 we shall analyze the general properties of the 
four-momentum operator in quantum mechanics and discuss at pedagogical 
level the Klein-Gordon equation for spinless particles. 

3.1 Generalities and Spin Equation 

Let us firstly recall the Schrodinger equation for a free particle. In the coor- 
dinate representation it has the form 



It can be obtained by means of the following eqs.(3.2a-c), performing the 
translation, in terms of differential operators acting onto the wave function 
'il){t, r), of the standard nonrelativistic expression 




(3.1) 
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It clearly shows that Schrodinger equation (3.1) is essentially nonrelativistic 
or, in other words, not compatible with Lorentz transformations. 



As discussed in refs.[5,6], the fundamental relation that is used for the study 
of (relativistic) quantum mechanics associates the four-momentum of a par- 
ticle to a space-time differential operator in the following form 

= ihd^ (3.2a) 

that, as explained in Subsection 2.1, means 

p^c^E^ ih^ (3.26) 

and 

d 

p = -ihV = -ih— (3.2c) 
ar 

The reader may be surprised that at relativistic level the same relations hold 
as in nonrelativistic quantum mechanics. As a matter of fact, eqs.(3.2a-c) 
express experimental general properties of quantum waves, as given by the 
De Broglie hypothesis. 

Furthermore, the connection with relativity is possible because ihd'^ is a 
contravariant four-vector operator. 

The easiest choice to write a relativistic wave equation consists in translating 
eq.(2.16) {instead of the nonrelativistic expression !) in terms of the space- 
time differential operators given by the previous equations. One has 

-n^d^d^ilj{t, r) = (mc) V(^, r) (3.3a) 

or, more explicitly, multiplying by 

-M' (^^ - V'l^, r) = m'c'^it, r) (3.36) 

Exactly as done for the electromagnetic field equations in Subsection 2.3, 
recalling the invariance of dfj,d^, one realizes that previous equation is mani- 
festly covariant. 



In order to make explicit calculations in atomic, nuclear and subnuclear 
physics, it is necessary to remember some numerical values (and the corre- 
sponding units !). We start considering the following quantities that appear 
in eq.(3.3b): 

he = 197.327 MeV fm 
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that is the Planck constant fi multiphed by the speed of hght c, expressed as 
an energy multiphed by a length. The energy is measured in MeV 

IMeV = lO^eV = 1.6022 x 10^^ Joule 

and the length in fm (femtometers or Fermis) 

Ifm = 10~^^m = IQ-^^cm 

Furthermore, the particle masses are conveniently expressed in terms of their 
rest energies. We give a few relevant examples 

ruec^ = 0.511 MeV 

for the electron 

mpC^ = 938.27 MeV 

for the proton, and 

m„c^ = 939.57 MeV 

for the neutron. 

Also note that the operator d'^ is, dimensionally, a length'^, that in our units 
gives fm~^. 

Going back to the formal aspects of eq.(3.3a,b), usually called Klein-Gordon 
equation, we note the two following aspects: 

(i) Being based on the relativistic relation among energy, momentum and 
mass of eq.(2.16) with the Dc Broglie hypothesis of cqs.(3.2a-c), the mani- 
festly covariant Klein-Gordon equation has a general validity, in the sense 
that the wave fuctions of all the relativistic free particles must satisfy that 
equation. As for the Dirac equation for spin 1/2 particles, see eq.(3.41) and 
the following discussion. 

(ii) In the Klein-Gordon equation it does not appear the particle spin. Or, 
equivalently, the function ip{t, r) is a one-component or scalar field function 
that describes a spin particle, as it happens in nonrelativistic quantum 
mechanics when spin is not included. 

The effects of rotations and Lorcntz boosts only consist in the argument re- 
expression discussed in Subsection 2.2. As explained in textbooks of quantum 
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mechanics, see for example ref.[7], the (infintesimal) rotations are performed 
by using the orbital angular momentum operator as generator. 



The Klein-Gordon equation admits plane wave solutions, corresponding to 
eigenstates of the four-momentum p^^ = (-f , p) in the form 



■0p(i, r) = Nexp 



h 



{-Et + pr) 



(3.4a) 



= A^exp ( -^PfiX^ 



N exp 



n 



[pfg[x] 



(3.46) 



where represents a normalization constant. The expression (3.4b) has 
been written using explicitly the Lorentz covariant notation. 
The most relevant point here is that the energy eigenvalue E can assume 
both positive and negative values (we shall see that it holds true also for 
Dirac equation !) We have 



E 



Ae(p) 



where 



[rac 



,2\2]l/2 



(3.5a) 



(3.56) 



and the energy sign \ = + / — 1 have been introduced. 

In quantum mechanics the A = — 1 solutions cannot be eliminated. They are 
strictly necessary to have a complete set of solutions of the wave equation. 
They can be correctly interpreted by means of charge conjugation in the 
framework of field theory, as done in most textbooks. Historically, starting 
from the work by Dirac, negative energy solutions lead to the very important 
discovery of the antiparticles, that have the same mass (and spin) but opposite 
charge with respect to the corresponding particles. 

We shall not analyze this problem here but postpone it to a subsequent work. 



As for the positive energy solutions, one can immediately check that in the 
nonrelativistic regime (|p|c << mc^) the Schrodinger limit is obtained. 

As an illustrative exercise, it may be useful to perform a Lorentz boost in 
eq.(3.4b). Given that we are considering a scalar field, we have to make only 
the argument re- expression. 
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In this concern recall that, for positive energy, the wave function of eq.(3.4a,b) 
represents a particle state such that an observer in S measures the particle 
four- momentum p^. 

In the reference frame S', for the space-time position one must use 

[x] =L-^[x'] 

(both the velocity and the hyperbolic parametrizations can be adopted and, 
for simplicity, no argument has been written in L^^) and replace it in eq.(3.4b). 
In the argument of plane wave exponential one has 

where we have used gL~^ — Lg from eq.(2.10) and also [pf^L — [p']^. In 
the previous result we recognize the invariance equation that, in standard 
notation, reads 

Physically, it means that an observer in S' measures the particle transformed 
four- momentum p'^. 

The Klein-Gordon equation admits a conserved current. We shall consider 
the form, related to a transition process, that is used in perturbation theory 
to calculate the corresponding probability amplitude. 
To derive the conserved current one has to make the following three steps. 

(i) Take eq.(3.3b) with a plane wave solution '4>pj{t,r) for an initial state of 
four- momentum p/. 

(ii) Take eq.(3.3b) with a plane wave solution il!pp.{t,r) for a final state of 
four- momentum pp and make the complex conjugate. 

(iii) Multiply the equation of step (i) by the complex conjugate ipp^it, r) and 
the equation of step (ii) by ippj{t,r). Then subtract these two equations, 
obtaining 

[d'^d^rpAt,v)]^p^{t,v)-rpAt,r)d,d'^^p,{t,r) = (3.6) 

Note that the mass term has disappeared. The previous equation can be 
equivalently written as a conservation equation in the form 

dMt,r)^0 (3.7) 
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where the conserved current is defined as (multiplying by the conventional 
factor ih) 



J^,{t, r) = th[r,,{t, r)9^^,,(t, r) - (^''^^ (t, r))^,,(t, r)] (3.8a) 



In the last equation the four-momentum transfer q'^ = Pp — of the transi- 
tion process has been introduced. 

The conserved current Jpiit, r) is manifestly a four- vector. 

The latter eq.(3.8b), that is obtained by explicit use of the wave functions, is 
very interesting. The first term (p/ +p^) represents the so-called four- vector 
vertex factor. 

Applying to eq.(3.8b) the derivative operator one verifies that current 
conservation relies on the following kinematic property of the vertex factor 



that is automatically satisfied because the mass of the particle remains the 
same in the initial and final state. 

As for the general properties of the current given in eqs.(3.8a,b) we find that 
in the static case, i.e. pp — Pi, the time component is negative if negative 
energy states (A = —1) are considered. It means that one cannot attach to 
J^j the meaning of probability density as it was done with the Schrodinger 
equation. For this reason we do not discuss in more detail the plane wave 
normalization constant A'^. 

Again, a complete interpretation of the Klein-Gordon equation and of its 
current is obtained in the context of field theory. 



In nonrelativistic quantum- mechanics a spin 1/2 particle is described by a 
two-component spinor (j). The spinor rotation is performed by mixing its 
components. At first order in the rotation angle one has 




(3.86) 



Quip'i + Pf) = Pf'^PFu - Pi^Pin = 



(3.9) 



3.2 Spin 1/2 Dirac Equation 




(3.10) 
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where the three Pauh matrices have been introduced. Their properties 

are studied in the Appendix. 

What is important to note here is that the matrix operators S^^ = —\(y'^ 
play the same role in realizing the rotations as the matrices 5*" defined in 
eq.(2.49). For this reason, their commutation rules are the same as those 
given in eq. (2.50b). Also, the s-pin or intrinsic angular momentum operator 
is defined [7] multiplying hj h/2 the Pauli matrices o"". 

Formally, we have introduced the two-dimensional representation of the ro- 
tation group ( the three-dimensional representation corresponds to spin 1, 
etc.). 

Finally, the spatial argument of the spinor (j) (not written expicitly in eq.(3.10)) 
is rotated with the same rules previously discussed for the arguments of the 
field functions, that is one has to perform the argument re- expression. 
In quantum mechanics, the generator of these rotations is the orbital angu- 
lar momentum operator 1 = r x p , so that the total angular momentum is 
given by the three generators of the total rotation (on the spinor and on the 
argument), in the form 

We can now try to introduce relativity. We shall follow a strategy similar to 
that of refs.[5,8], but avoiding many unessential (at this level) mathematical 
details. 

First, we note that for a particle at rest, the relativistic theory must coincide 
with the previous nonrelativistic treatment. 

Second, we make the following question: can we find a set of three 2x2 

boost matrices (acting on the two-component spinors) that satisfy, with the 
S*^] = — fo"" replacing the 5", the same commutation rules as the K"' in 
eqs.(2.50a-c)? 

The answer is yes. A simple inspection of eqs.(2.50a-c) and use of the stan- 
dard property of the Pauli matrices given in eq.(A.2) show that the matrices 
7^j2] = f '^'^ satisfy those commutation rules. 

Eq. (2.50a) rcqinres = 1, while eq. (2.50c) docs not give any new constraint 
on the parameter r, that, in consequence can be chosen equivalently as 
T = +/-!. 

However, a serious problem arises when trying to introduce the parity trans- 
formation matrix. It must satisfy, both the anticommutation rule with the 



31 



boost generators as in cq. (2.51a) and the commutation rule with the rotation 

generators as in cq. (2.52a). In our 2x2 case, boost generators and rotation 
generators arc proportional to the Pauli matrices, so there is no matrix that 
satisfies at the same time the two rules [8]. 

In consequence, we can construct a two-dimensional theory for spin 1/2 par- 
ticles that is invariant under Lorentz transformations but not under parity 

transformations. 

On the other hand, the first objective that we want to reach is the study 
of the electromagnetic interactions of the electrons in atomic physics and in 
scattering processes. To this aim we need an equation that is invariant under 
spatial inversion. 

A parity noninvariant equation for spin 1/2 particles, based on the transfor- 
mation properties outlined above, will be used for the study of the neutrinos 
that are created, destroyed and in general interact only by means of weak 
interactions that are not invariant under spatial inversions. 

In order to construct a set of matrices for spin 1/2 particles satisfying both 
Lorentz and parity commutation rules, we make the two following steps: 

(i) we consider matrices with larger dimension; 

(ii) we exploit the sign ambiguity of r in the boost generator. 

More precisely, it is sufficient to introduce the following 4x4 block matrices 





-a^ 



(3.11) 



where we have taken t — and r = — 1 in the upper and lower diagonal 
block, respectively. 

Important note: the previous equation represents the definition of the three 
matrices . We use the greek letter 5 (instead of a) as spatial index to avoid 
confusion between the indices and the matrices. 



With no difficulty, for the spinor rotations we introduce 





so that 



a 



(3.12a) 
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Note that, taking into account the discussion for the transformation of the 
two-dimensional spinors with 5*^] and K^2]j the commutation rules of eqs. (2.50a- 
c) for Lorentz transformations and rotations are automatically satified by the 
block diagonal matrices K^d]^ ^[d] introduced above. 

For the spatial inversion, we find the 4x4 block matrix 

= (3 (3.13) 
with the property 

n[D] = nU = ^[D] (3-14) 

It satisfies the anticommutation with the boost generators of eq.(2.51a,b), 
that means 

{np],Xf^]} = {/3,a^} = (3.15) 

The specific form of n[D] straightforwardly satisfies also the rules (2.52a,b). 
In technical words, we have obtained a representation of the Lorentz group, 
including parity, for spin 1/2 particles. 

Introducing explicitly the four component Dirac spinor u, its boost transfor- 
mation is written in the form 

u'^BiD]{u})u (3.16) 
The (infinitesimal) form of -B[£i](cj) at first order in cu is 

B[D]{u;) 1 - ^ u; {air) (3.17a) 

where, as usual, v represents the unity vector of the boost velocity. (For 
simplicity, we do not write it explicitly in Bp] (a;).) 

The finite transformation is obtained in the same way as in eqs.(2.36a,b) and 
(2.40) but using the properties of the Pauh matrices, as it is shown in detail 
in eqs.(A.17),(A.18) and in the following discussion in the Appendix. One 
has 

£p](a;) = exp[-|(av)] = c/i(|) - iav)sh{^) (3.176) 

On the other hand, the spinor rotations are obtained by replacing the 
with the 4x4 matrices in eq.(3.10). 



n 



1 

1 
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Furthermore, when changing the reference frame, one has always to perform 

the argument re- expression in the Dirac spinors u. 

We note that, while the rotations are represented by a unitary operator, the 
Lorentz boost are not. More precisely, i^p] (a;) is a antiunitary operator, that 
is 

Sf^jM^SpiM (3.18) 

A unitary, but infinite dimensional (or nonlocal) representation of the boost 
for spin 1/2 particles can be obtained. This problem will be studied in a 
different work. 

The next task is to construct matrix elements (in the sense of vector algebra 
and not of quantum mechanics, because no spatial integration is performed) 
of the form Ub'Mua , that, when boosting Ua and Ub, transform as Lorentz 
scalar and Lorentz four- vectors. The case of pseudoscalars and axial-vectors 
will be studied in Subsection 3.4. 

We shall keep using the word matrix elements throughout this work, but 
in most textbooks they are commonly denoted as Dirac covariant bilinear 
quantities. 

Given a generic 4x4 matrix M, by means of eq.(3.17a) the transformation 
of the matrix element up to first order in a;, is 

u'.^Mu'^ ~ uMua v^Ub^{a\ M}ua (3.19) 

The Lorentz scalar matriz element is easily determined by means of a matrix 
Ms that anticommutes with the so that the second term in the r.h.s. of 
eq.(3.19) is vanishing. Simply recalling eqs.(3.15) and (3.13) one has 

(3.20) 

where we are using the definition of the /3 Dirac matrix given in eq.(3.13). 

As for the four- vector matrix element, one needs four matrices . To find 
their form in a simple way, let us consider a boost along the x^-axis, that 
in eq.(3.19) means v = (1,0,0). By means of eq.(3.19), to recover the four- 
vector Lorentz transformation (see eqs.(2.1) and (2.34)), one needs 

^{a\M°} = M,i (3.21) 



1 

1 
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for the transformation of M^, and 

^{a\M'J = M'^ (3.22) 

for the transformation of . 

The solution is easily found calculating the anticommutators of the Dirac ma- 
trices by means of the anticommutators of the Pauli matrices of eq.(A.3). 
One has 

M° = a° = 1, = (3.23a) 

and the solution for all the components is 

= = (1, a\ a^, o?) (3.236) 

Pay attention: = 1 is not introduced in most textbooks. 

We can resume the previous equations, also for finite Lorentz boosts, as 

S[o](u;)/3Sp](u;)=/3 (3.24a) 

or, equivalently 

B^n\{^)i5 = i5B^^^{i^) (3.246) 
for the scalar matrix elements, and 

S[o](a;)«'^Sp](a;) = UM^^" (3-25) 

for the four- vector ones. 

The previous developments, recalling the expression of the four-momentum 
operator given in eqs.(3.2a-c), allow to write a linear covariant wave equation 
in the form 

ihc d^a^i/^ix) = mc^ /3V(^) (3.26) 

that is the Dirac equation, where m is the particle mass and ip{x) — ip{t,r) 
is a four component Dirac spinor representing the particle wave function. 
Intuitively, the covariance of the Dirac equation can be proven multiplying 
the previous equation from the left by a generic hermitic conjugate Dirac 
spinor. In the l.h.s. one has a Lorentz scalar given by the product of the 
(contravariant) four- vector martix element of with the (covariant) oper- 
ator ihc . In the r.h.s. one has the Lorentz scalar directly given by the 
matrix element of j3. 
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More formally, we can prove the covariance of the Dirac equation in the 
following way. We write the same equation in S' and show that is equivalent 
to the (original) equation in S. We have 

ihc d = rnc'^ f^^'i^') (3.27) 

The spinor in S' is related to the spinor in S by means of eq.(3.16): 

7jj'{x')^B[n]{u;)^{x\x)) (3.28) 

We replace the last expression in eq.(3.27) and multiply from the left that 
equation by 5[£)](a;). In the r.h.s., by means of eq.(3.24a) one directly obtains 
Pip. In the l.h.s., one has to consider eq.(3.25), transforming the equation in 
the form 

ihc d '^L''^{uj)a''ip{x'{x)) = mc^ PiIj{x\x)) 
We can use the more synthetic vector algebra notation, writing 

d '^L^^a'' = [d 'fgL[a] = [dfg[a] = d,a^ (3.29) 

where in the second equality we have used gL = L^^g. 

In this way we have shown the equivalence of eq.(3.27), written in S', with 

the original equation (3.26), written in S. 

3.3 The Gamma Dirac Matrices and the Standard Rep- 
resentation 

The physical content of the Dirac equation is completely contained in eq.(3.26) 
and in the related transformation properties. However, to work in a more 
direct way with Dirac equation and its applications, some more developments 

are necessary. 

First, we introduce the Dirac adjoint spinor that is preferably used (instead 
of the hermitic conjugate) to calculate matrix elements. It is defined as 

u = u^'P (3.30) 

Its transformation law is straightforwardly obtained in the form 

u' = u'^p = u^BiD]{u;)p = uBTj^Au) (3.31) 
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where eq. (3.24b) has been used. As it must be for a representation of the 
Lorentz boost, B^j^^{u!) is obtained inverting the direction of the boost veloc- 
ity 



= + (av)s/i(|) ~ 1 + i a; (av) 



(3.32) 



As an exercise, the reader can check that B[r)]{oj)B^^^{uj) = 1 by using the 
properties of the matrices. 

Note that in the previous results there is no new physical content. We can 
represent the Lorentz scalar {invariant) as 

ulPUa = UbUa (3.33) 

In fact, we have learned in eq.(3.31) that u transforms with B^j^-^{uj). 
We introduce the Dirac matrices 7^ defined as 



RecaUing that = 1, one has 

The four- vector matrix element can be written as 

and the Dirac equation (3.26) takes the usual form 

ihc d^'-y^ip^x) = mc^ ip{x) 
For clarity we give the explicit expression of the 7^ 



"0 


r 




■ 




.1 


0. 










(3.34a) 



(3.346) 



(3.35) 



(3.36) 



(3.37) 



As it will be discussed in the following, this is the so-called spinorial repre- 
sentation of the Dirac matrices. 

Starting from the anticommuation rules of the a'^ one finds the following 
fundamental ! anticommutation rules of the 7^* 



(3.38) 
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Furthermore, one easily verifies that 7° is hermitic while the 7*^ are anti- 
hermitic: 

-i^"^ = g^^-f" = 7%^7° (3.39) 

Note that in g^^ the index not summed; the last equality is obtained by 
standard use of eq.(3.38). Furthermore, the previous equation also holds in 
the standard representation of the Dirac matrices that will be introduced in 
the following. 

We can now easily examine the usual procedure that is adopted to introduce 
the Dirac equation. Consider, for example, refs.[6,9]. The differential wave 
equation for a spin 1/2 particle is assumed to be linear with respect to the 
four-momentum operator introduced in eq.(3.2a.c) and to the particle mass. 
According to this hypothesis, the equation is written as 

ihc df.V'tpix) = mc? i\){x) (3.40) 

where the are four adimensional matrices to be determined. 

Then, one multiplies by iUc d^X^ and, by using the same eq.(3.40), obtains 

in the l.h.s. another factor mc? . The equation takes the form 

-(TicfdJ^d^Y^^ix) = {mc^f ij{x) (3.41) 

As we said in Subsection 3.1, the wave function of any relativistic parti- 
cle must satisfy the Klein-Gordon equation (3.3a,b). This property must 
be verified also in our case. To this aim, we make the following algebraic 
manipulation 

^.F'^a^F^ = ^ a^a,(FT'' + F'^F'^) 

It shows that the F^ must satisfy the anticommutation rules of eq.(3.38). 
The lowest dimension for which it is possible is 4 and we can identify the 
F'* with the 7^* of eq.(3.37) that have been derived by means of relativistic 
transformation properties. 

In any case, (we repeat) the previous development is useful to show that the 
solutions of the Dirac equation are also solutions of the Klein-Gordon one. 
We can expect that also Dirac equation admits negative energy solutions. 

We now face a different problem. In Subsection 3.2 we have seen that the 
relevant point for the covariance of the Dirac equation is represented by the 
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anticommutation rules of the and /3 matrices. The same is true for the 
7^*. In other words, their specific form is not important, provided that the 
anticommutation rules are fulfilled. We now look for another representa- 
tion, different from eq.(3.37), and more useful for practical calculations. We 
construct this new representation starting from a specific solution of Dirac 
equation (3.26) or (3.36). 

Let us consider a particle at rest, that is, in a three-momentum eigenstate 
with p = 0. The spatial components ^ of the derivative operator, when 
applied to the corresponding wave function, give zero. The Dirac equation 
reduces to 



d'i/j{x) 
dt 



mc 



1 

1 



(3.42a) 



We can split the Dirac spinor into two two-component spinors 



So that eq. (3.42a) is written as a system of coupled equations: 

in— = mc E 
dt ^ 



ih 



dt 



(3.426) 



We can sum and subtract these two equations introducing the new two- 
component spinors 



X 



(3.43) 



(the factor guarantees that normalization of the new Dirac spinor is not 
changed). One finds 



dip 



-m(?x 



(3.44) 



These equations are equivalent to eq. (3.42b) but they are decoupled. Techni- 
cally, we have diagonalized the r.h.s. rest frame Hamiltonian of eq. (3.42a). 
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The solutions are easily found: 







with positive energy E = +mc^ 



with negative energy E — —mc^. The presence of two energy values repre- 
sents a general property of relativistic wave equations. 

The advantage of the solutions of eq.(3.44) is that only one two- 

component spinor is nonvanishing while the other is zero. In the positive 
energy case, the nonvanishing spinor can be identified with the nonrelativis- 
tic one. Furthermore, when considering a positive enery particle with small 
(nonrelativistic) velocity, we can expect the lower components of ■0+ to be 
(not zero but) small with respect to the upper ones. 

For these reasons we apply that transformation to a generic Dirac spinor, 
not only in the case p = 0. 

More formally, we perform the transformation of eq.(3.43) by introducing the 
following matrix 

1 r 



u = ^ 



1 

x72 



(3.45) 



that satisfies 



C/^ = u- 



u 



We multiply from the left the Dirac equation (3.36) by U and insert UU = 1 
between the 7'^ and ip . In this way we transform the Dirac wave function 
and, at the same time, the Dirac matrices obtaining 



(3.46) 



where the 7^^ are the Dirac matrices in the standard representation, while 
the 7^^ of eq.(3.37) have been given in the so-called spinorial representation. 
In most physical problem (specially if a connection with nonrelativistic physics 
is wanted) the standard representation is adopted. Generally the index "st" 
is not explicitly written. In the following of the present work we shall also 
adopt this convention. 
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Note that, due to the property of U given above, if two matrices in the 
spinorial representation satisfy an (anti) commutation rule, the corresponding 
matrices in the standard representation also satify the same rule. 

In particular, this property holds for the anticommutation rule of eq.(3.38) 
of the 7'*. In the standard representation they have the form 



" 1 0' 




■ a^' 


.0 -1. 




_-a^ . 



(3.47) 



The hermitic conjugate satify the same eq.(3.39). As for the a'*, by using 
eq. (3.34b), one has 



a^, = Ua^U = UI3UUYU = 7°t7.^* 
Explicitly, without writing the index "st", they are 



"1 


0' 




" a^- 


.0 


1. 







(3.48) 



(3.49) 



Note that the spin E"^ matrices of eq.(3.12a) keep the same form in the 
spinorial and standard representation. 

In consequence, one can define K^^j] = ^a^ and S'pj = — f^'' by using the 
standard representation for the (and the T,^): the boost and rotation 
generators commutation rules are equivalently fulfilled. Furthermore, the 
expression of the boost operator is the same as in eq.(3.17a,b), with the 
written in the standard representation. 

3.4 Parity Transformations and the Matrix 7^ 

There is a fifth matrix that anticommutes with the other 7^*. It is 7^: 

{7^7'} = (3.50) 
In the spinorial and standard representations, one has, respectively 



7: 



sp 



"-1 


0' 




" -1" 





1. 


^5 _ 
' 1st 


.-1 0. 



(3.51) 



Note that 7^^ = 7^ and (7^)^ = 1. 
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Furthermore, we use the definition of ref. [5], but, as done in many texbooks, 
7^ can be defined mufiiplying eq.(3.51) by —1. All its properties remain 
unchanged. Pay attention to which definition is used ! 

To understand the physical meaning of the matrix elements of 7^, it is useful 
to go back to Dirac spinor parity transformation. As shown in eq.(3.13), 

this transformation is u' = j3u being = 7°. Let us consider the parity 
transformation for Lorentz scalar and four- vector matrix elements. Standard 
use of the 7^^ anticommutation rule (3.38) gives 

UbUa = U},Ua (3.52a) 

and 

Ubl^'ud = Ubi'^Ua (3.526) 

UbJ^Ua = -UbJ^Ua (3.52c) 

These results have an easy physical interpretation: a Lorentz scalar and a 
time component of a four- vector (for example a charge density) do not change 
sign under spatial inversion, while the spatial components of a four-vector 
(for example a current density) do change sign. 

Let us now consider the following matrix element 

Ub'y^Ua 

The Lorentz boost are studied by means of eq.(3.19) taking Mps = 7^7'^. 
Standard use of eqs.(3.50) (3.38) and (3.34a,b) show that 

{a^7V} = (3.53) 

so that we can conclude that our matrix element is invariant under Lorentz 
transformations. The same can be shown for rotations using the generator 
of eq.(3.12a,b). 

But, what happens with spatial inversion ? We have 

Ub'^^Ua' = Ubl^l^l^uJ = -UbUa (3.54) 

It means that our matrix element changes sign under parity transformation. 
It is a pseudo-scalar quantity. 
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In terms of elementary quantities, a pseudo-scalar is given by the product 
of an axial vector (see the discussion of subsect 2.5) with a standard vector, 
for example the spin with the three-momentum: sp. (It is not possible to 
use the orbital angular momentum instead of spin because one has Ip = 0, 
identically) . 

We now consider the following matrix element 

Standard handling (that is left as an exercise) with the 7'^ and 7^ shows 
that, under Lorentz boosts and rotations, it transforms as a four-vector, but, 
under spatial inversion, one has 

UblVua' = UblVlVua = -Ub^fV^a (3.55a) 

and 

Ubl^7^Ua' = UblVl^l'^Ua = +Ub7^7^Ua (3.556) 

We have an axial four-vector. Its time component changes sign, while the 
space components do not. 

3.5 Plane Wave Solutions and the Conserved Dirac 
Current 

In this last Subsection we shall find the plane wave solutions of the Dirac 
equation for a noninteracting particle, and, as in the case of the Klein-Gordon 
equation, we shall determine the conserved current. 

At this point the equations become very large and it is necessary to find a 
strategy to simplify the calculations and avoid to lose the physical meaning 
of the developments. For this reason, most textbooks adopt the system of 
units in which 

In any part of the calculations one can go back to the standard units recalling 
the following dimensional equalities 

[h] = [E] [T], [c] = [L] [T]-' 

and use the numerical values given in Subsection 3.1. 
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In this way, Dirac equation (3.36) is written in the form 

[id^Y - m] ^(x) = (3.56) 

Let us make the hypothesis that the wave function il){x) can be factorized in 
plane wave exponential, identical to that of the Klein-Gordon equation given 
in eqs.(3.4a,b), and a Dirac spinor not depending on the four- vector x. Also 
using eq.(3.5a) for positive and negative energy, being A the energy sign, we 
can write 

i^\pa{x) = u{X, p, a) exp [i(-Ae(p)t + pr)] (3.57) 

The spin label a of the Dirac spinor (not to be confused with the Pauli 
matrices) will be discussed in the following. 

Applying the space-time derivative operator to the previous equation one has 

idn-ipxpaix) = (Ae(p), -p)'u(A, p, a) exp [i(-Ae(p)i pr)] (3.58) 

where the the minus sign in — p is due to the use of covariant components of 
the operator id^. 

We insert the last result in the Dirac equation (3.56). Cancelling the expo- 
nential factor, it remains the following matrix equation for the Dirac spinor: 

[Ae(p)7° - (p7) - m]K(A, p, a) = (3.59) 

As in eq.(3.43), we write the four component Dirac spinor in terms of two 
two-component ones: 

.X. 

where </?, x ^^e respectively defined as upper and lower components of the 
spinor. For brevity we do not write the indices A, p, cr in (/? and x- 
Using the 7'^ in the standard representation of eq.(3.47), we can write eq.(3.59) 
in the form: 

(Ae(p) -m)^ - (pa)x = (3.61a) 

(Ae(p)+m)x-(pfT)v^ = (3.616) 

Considering positive energy states, that is A = +1, we obtain the lower 
components x+ terms of (^+ by means of eq. (3.61b): 

e(p) m 

AA 



w(A,p,(7) = 



(3.60) 



In this case it is not possible to write 93+ in terms of x+ using eq.(3.61a) 
because, with A = +1, the factor Ae(p) — m is vanishing for p = 0. 
Conversely, for negative energy states, that is A = —1, from eq. (3.61a) we 
obtain the upper components: 



e(p) + m 



X- 



(3.626) 



In this way we have found the plane wave solutions of Dirac equation for a 

noninteracting particle. The two-component spinors 99+ . X- can be chosen 
(but it is not the only possible choice), as those of the nonrelativistic theory. 
Denoting them as w„, with the property wl^,Wa = Sa'a, one has explicitly 



w+ 



for spin up and down, respectively. 

In consequence the Dirac spinors u{X, p, a) can be put in the form 



and 



u{+l,p,a) = N 



e(p)+m " 



e(p)+m 



(3.63a) 



(3.636) 



We point out that, in general, the spin label a of w„ does not represent the 
spin eigenvalue in a fixed direction, for example the axis. This property 
holds true only for a particle at rest. In this case the previous solutions 
coincide with the solutions of eq.(3.44). 

General properties of spin and angular momentum for Dirac equation will be 
studied in a subsequent work. 

The Dirac spinors of eqs.(3.63a,b) can be also conveniently written as 

m(A, p, (j) = m(A, p)w^ (3.64) 

with 



ii(+l,p) = Ar 



1 

(p'^) 

e(p)+m 



(3.65a) 
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and 



^-l,p) = 7V 



(p'^) 

e(p)+m 
1 



(3.656) 



where the u{X, p) represent 4x2 matrices. They must be apphed onto the 
two-component (column) spinors w^, giving as result the four component 
Dirac (column) spinors of eqs.(3.63a,b). 

Note that, in contrast to the nonrelativistic case, the Dirac spinors depend 
on the momentum of the particle. 

We now discuss the normalization factor A^. In nonrelativistic theory, the 
plane wave of a spin 1/2 particle is "normalized" as 



Wcr exp [i{—Et + pr)] 



where V represents the (macroscopic) volume where the particle stays. The 
probabilty of finding the particle in this volume is set equal to one. However, 
y is a fictitious quantity that always disappears when physical (observable) 
quantities are calculated. In consequence, for the sake of simplicity, one can 
put V — 1. In this way, one has 

A similar result can be obtained for the Dirac equation plane waves, putting 
in eqs.(3.63a)-(3.65b) 



m 



\ 26(p) 



(3.66) 



where nc stands for not covariant . In fact this normalization cannot be 
directly used for the calculation of covariant amplitudes. With this nonco- 
variant normalization, the Dirac wave function satifies the following normal- 
ization equation that is analogous to the nonrelativistic one 



(3.67) 



As an exercise, verify this result and that of eq.(3.69), by using eq.(A.7) for 
the products of (crp). Also use the identity 

= [<P)? -m^ ^ (e(p) + m)(e(p) - m) 
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The covariant normalization is obtained taking 



By using this normaHzation one has 

k(A', p, a')u{X, p, a) = {-lf5xx'Saa' (3.69) 

that, also recaUing eq. (3.52a), represents an exphcitly Lorentz invariant con- 
dition. 

In many textbooks a shghtly different covariant normalization in used, that 
is 

SO that a factor 2m appears in the r.h.s. of eq.(3.69). 

When reading a book or an article for the study of a specific problem, pay 
attention to which normalization is really used ! 

For further developments it is also introduced the spinor corresponding to 
negative energy, negative momentum — p (and spin label a). Prom eq.(3.63b) 
or (3.65b) one has 



u{-l,-p) = N 

Note that 



(p^) 

e(p)+m 
1 



(3.70) 



tz(-l,-p) = -7'tx(+l,p) (3.71) 
That spinor is standardly applied to Wa, as in eqs. (3.63b) and (3.64). 

We conclude this section studying the transition current associated to the 
Dirac equation in the same way as we studied that of the Klein-Gordon 
equation in eqs.(3.6)-(3.8b). 

First, one has to write the Dirac equation for the adjoint wave function 

v;(x) = v^(x)7° 

To this aim, take the Dirac equation (3.56) and calculate the hermitic con- 
jugate. By using eq.(3.39), one finds 

-id^ip^ (x)7°7'^7° -ij^{x)m^Q (3.72a) 
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Multiplying this equation from the right by — 7*^ one obtains 

id^ipix)^'' + ij{x)m = (3.726) 
that is the searched equation. 

As done for the Klein-Gordon equation we obtain the conserved current by 
means of the following three steps. 

(i) Take eq.(3.56) with a plane wave, initial state, solution ipi{x) correspond- 
ing to energy sign A/, three-momentum p/ and spin label (7/. 

(ii) Analogously, take eq.(3.72b) with a plane wave, final state, solution 

(iii) Multiply the equation of step (i) by iPf{x) and the equation of 

step (ii) by ipi{x). Then sum these two equations (note that the scalar mass 
term disappears), obtaining 

dM^) = (3.73) 
where the Dirac conserved current is 

Jf^jix) = Mx)YMx) (3.74a) 

= u{Xf, Pf, aF)Yu{Xi: Pi: CTj) exp(ig^x'') (3.746) 

with the four- momentum tranfer q'^ = pp — p'^. The four- vector character of 
the Dirac current is manifestly shown by the previous equation. 
The Dirac four- vector vertex is 

up-f^ui = u(Xf, Pf, (Jf)i^u{\i, Pi, aj) (3.75) 

Due to current conservation it satisfies, analogously to eq.(3.9), 

q^UFl^ui = (3.76) 

Note that in the static case the current density (differently from the Klein 
Gordon equation) is a positive quantity both for positive and negative enery 
states, as shown explictly by the second equality of the following equation: 

Jn = M^h^'M^) = 4{x)Mx) > (3.77) 

This property allows to attach (for some specific problems) a probabilistic 
interpretation to that quantity and to consider ■0(x) as a wave function in 
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the same sense of nonrelativistic quantum mechanics. However, the presence 
of negative energy solutions requires, in general, the introduction of the field 
theory formalism. 

The vertex of eq.(3.75) at first glance looks very different with respect to 
that of the Klein-Gordon equation (pp+p'j) given in eq.(3.8b). The so-called 
Gordon decomposition, with some algebra on the Dirac matrices, shows that 
it can be written in a form that is more similar to the Klein-Gordon one. 
This procedure will be analyzed in a subsequent work. 
For the moment, using the properties of the Pauli matrices, the reader can 
show that 

u{X, p, a')ru{X, p, a) = ^S,,, (3.78) 

m 

with = (e(p),p). The covariant normahzation of eq.(3.68) has been used. 

We conclude this work noting that, at this point, the reader should be able 
to use the main tools related to Dirac equation, being also familiarized with 
the issues of relativity in quantum mechanical theories. 
More formal details and calculations of physical observables can be found in 
many textbooks and will be studied in a subsequent work. 

4 Appendix. Properties of the Pauli Matri- 
ces 



The three Pauli matrices are defined as follows 



"0 


1' 




-% 




" 1 0" 


.1 


0. 




J 0. 




.0 - 1. 



(Al) 



they are 2x2, traceless, hermitic (cr"^ = cr") matrices. The Pauli matrices 
fulfill the the following commutation rules 

[(7",(7^] =2^^*^^^ (A2) 

One defines the spin, that is the intrinsic angular momentum operator, mul- 
tiplying the (t" by fi/2. 

By means of this definition, the spin satisfies the standard angular momentum 
commutation rules, that are 
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Independently, the Pauli matrices fulfill the anticommutation rules 

Summing up eqs.(A.2) and (A. 3) and dividing by two, one obtains the very 
useful relation 

a^a^ = + te'^'^^^a^ (AA) 
Obviously only two of eqs.(A.2), (A. 3) and (A. 4) are independent. 

Given the three-vectors a and b, let us multiply the previous expression by 
a" and b^, summing over the components. One obtains 

(cra)(crb) = ab + i(aa X b) (^.5) 

Note that (era) represents the following matrix 



(aa) 



a} + ia^ — 



{A.Q) 



and analogously for (crb) and (era x b). 

In eq.(A.5), if b = a, the vector product is vanishing, so that one has 

(aa)2 = a? {A.l) 

Starting from this equality we can calculate the function /(era). 

To this aim we recall that, if a function f{x) of a standard variable x has the 

Taylor expansion 

oo 

f{x) = Y.CnX^ (A8) 

n=0 

the (same) function of the matrix (ca) is defined as follows 

oo 

/(aa) = ^c„(aa)" (A9) 

n=0 

The result is obviously a 2 x 2 matrix. 

Incidentally, the previous definition, that makes use of the Taylor expansion 
in powers of the argument matrix, is a general one: it holds not only for (era) 



50 



but also if the argument of the function is a matrix of any dimension or if it 
is a Unear operator. In the present case, the powers (aa)" in eq.(A.9) can be 
calculated by means of eq.(A.7). We also use (o"a)° = 1. 
We make here some algebraic developments to obtain a "closed" expression 
for eq.(A.9). 

First. let us write separately the even and the odd powers in the expansion 
(A.8): 

oo oo 

f{x) = E ^^mX'"^ + E C2^+lX^'+^ (A.10) 
m=0 1=0 

Do the same for /(— x): 

oo oo 

n-x) = E C2n.a;'" - Y.^2l+lX^'^^ {A.ll) 
m=0 1=0 

So that, summing and subtracting the last two equations, one has: 

-1 oo 

^[f{x)+f{-x)]=Y.C2mX''^ (A12) 
^ m=0 

-I oo 

-[f{x)-f{-x)] = Y.C2i+ix''''' (A13) 

^ 1=0 

Let us now go back to eq.(A.9), introducing the unit vector a and the absolute 
value {positive !) |a|, by means of the standard relation 

a = |a|a (^-14) 
Furthermore, by means of eq.(A.7), one has 

(aa)'"^ = (a')"^ = |a|'"^ (A15) 

(aa)2'+i = (aa)(a2)' = (cTa)|a|2^+i (A.16) 

In consequence, writing separately the even and odd powers in eq.(A.9), and 
using eqs.(A.12,13), one obtains 

oo oo 

/(aa) = <^2m\^?'^ + (aa)Ec2z+i|a|2'+i = 

m=0 1=0 

= ^[/(|a|) + /(-|a|)] + ^[/(|a|) - /(-|a|)](aa) (A17) 
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In order to derive the second equality of eq.(3.17b), being the , defined in 
eq.(3.11) as block diagonal matrices, one can procede separately for the two 
blocks. Let us consider first the upper left block. By means of the previous 
equation, one has 



exp 



(TV 

2^ ' 



exp I 



+ exp(-|-|) 



exp(l-l) -exp(- 



00 



{A.18) 

Also, we have used 



In the previous equation sgn{u}) gives the sign of uj. 
B.— —sgn{u;)v and |a| = in eq.(A.17). 

As for the lower right block, one easily obtain the result that is analogous to 
the previous one, but with a plus sign in front of the second term. Recalling 
the form of the matrices, one obtains the final result of eq.(3.17b). 



The reader can now look at this development in a slightly different way. 
Recalling the form of the of eq.(3.11), the powers of (aa), satisfy the same 
relations as eqs.(A.15) and (A. 16) for the powers of (era). In consequence, 
one can repeat the calculations of eqs.(A.17) and (A.18) simply replacing the 

with the a^, obtaining directly eq.(3.17b). 
Furthermore, in this way, one realizes that the result remains the same also 
in the standard representation and only depends on the anticommutation 
rules of the matrices. 
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